SORT: An Interactive Source-Rewriting Tool for Improved Translation

نویسندگان

  • Shachar Mirkin
  • Sriram Venkatapathy
  • Marc Dymetman
  • Ioan Calapodescu
چکیده

The quality of automatic translation is affected by many factors. One is the divergence between the specific source and target languages. Another lies in the source text itself, as some texts are more complex than others. One way to handle such texts is to modify them prior to translation. Yet, an important factor that is often overlooked is the source translatability with respect to the specific translation system and the specific model that are being used. In this paper we present an interactive system where source modifications are induced by confidence estimates that are derived from the translation model in use. Modifications are automatically generated and proposed for the user’s approval. Such a system can reduce postediting effort, replacing it by cost-effective pre-editing that can be done by monolinguals.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Confidence-driven Rewriting for Improved Translation

Some source texts are more difficult to translate than others. One way to handle such texts is to modify them prior to translation. Yet, a prominent factor that is often overlooked is the source translatability with respect to the specific translation system and the specific model that are being used. We present an approach, and an interactive tool implementing it, where source sentences are re...

متن کامل

An Interactive Rewriting Tool for Machine Acceptable Sentences

This paper proposes an interactive rewriting tool for supporting people in creating machine acceptable sentences. The experiment and evaluation of this tool conducted when applied to a pre-editing support tool in a Japanese-to-English machine translation (henceforth, MT) system are also described.

متن کامل

Sentential Paraphrasing as Black-Box Machine Translation

We present a simple, prepackaged solution to generating paraphrases of English sentences. We use the Paraphrase Database (PPDB) for monolingual sentence rewriting and provide machine translation language packs: prepackaged, tuned models that can be downloaded and used to generate paraphrases on a standard Unix environment. The language packs can be treated as a black box or customized to specif...

متن کامل

Automatic Rewriting for Controlled Language Translation

This paper summarizes the goals of controlled language authoring for machine translation, and discusses why automatic rewriting is an important research area for controlled authoring 1. We present the KANT controlled language rewriting architecture , which combines a rewriting engine with interactive dialogue. We also present some rewriting examples which are based on the results of an empirica...

متن کامل

An open-source web-based tool for resource-agnostic interactive translation prediction

We present a web-based open-source tool for interactive translation prediction (ITP) and describe its underlying architecture. ITP systems assist human translators by making context-based computer-generated suggestions as they type. Most of the ITP systems in literature are strongly coupled with a statistical machine translation system that is conveniently adapted to provide the suggestions. Ou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013